Sociolinguistics for Computational Social Science

نویسنده

  • Sali A. Tagliamonte
چکیده

In recent years, a major growth area in applied natural language processing has been the application of automated techniques to massive datasets in order to answer questions about society, and by extension people. Sociolinguistics, which combines anthropology, statistics and linguistics (e.g. Labov 1994, 2001), studies linguistic data in order to answer key questions about the relationship of language and society. Sociolinguists focus on frequency and patterns in linguistic usage, correlations, strength of factors and significance, which together reveal information about the sex, age, education and occupation of speakers/writers but also their history, culture, place of residence, social relationships and affiliations. The findings arising from this type research offer important insights into the nature of human organizations at the global, national or community level. They also reveal connections and interactions, the convergence and divergence of groups, historical associations and developing trends. In this paper, I will introduce sociolinguistic research and the nature of sociolinguistic field techniques and sample design. I will argue that socially embedded data is critical for analyzing and discovering social meaning. Then, I will summarize the findings of several case studies. What does the use of a 3rd singular morpheme -s, as in (1), tell us about the history and culture of a community (Tagliamonte 2012, 2013)? How is quotative be like, (2), spreading in geographic space (Tagliamonte to appear)? What is the mechanism that underlies linguistic change (Tagliamonte & D’Arcy 2009) and by extension cultural trends and projections? 1. The English people speaks with grammar. 2. I was like, “Hey how are you going?” And hes like, “Im fine.” Using sociolinguistic datasets, the answers to these questions have successfully addressed pre-vailing puzzles and offered solutions to real world problems. However this type of research isonly be as good as the quality of the data, the capability of the technologies for extracting andanalyzing what is important, and the relevance of the socially cogent and statistically sound inter-pretations. I will argue that Sociolinguists and Computational Scientists could be powerful alliesin uncovering the complex structure of language data and in so doing, offer unsurpassed insightinto varying human states and conditions. ReferencesWilliam Labov. 1994. Principles of Linguistic Change: Volume 1: Internal Factors. Blackwell.William Labov. 2001. Principles of Linguistic Change: Volume 2: Social Factors. Blackwell. Sali A. Tagliamonte and Alexandra D’Arcy. 2009. Peaks beyond phonology: Adolescence, incrementation, andlanguage change. Language, 85(1):58–108. Sali A. Tagliamonte. 2012. Variationist Sociolinguistics: Change, Observation, Interpretation. Wiley-Blackwell.Sali A. Tagliamonte. 2013. Roots of English: Exploring the History of Dialects. Cambridge University Press.Sali A. Tagliamonte. To appear. System and society in the evolution of change: The view from Canada. InE. Green and C. Meyer, editors, Faces of English. De Gruyter-Mouton.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computational Sociolinguistics: A Survey

Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language. In this article we present a survey of the emerging field of ”computational sociolinguistics" that reflects this increased interest. We aim to provide a comprehensive overview of CL r...

متن کامل

The Role of Sociolinguistics in Second Language Acquisition

Learning a new language also involves learning a broad system of norms for social relations.This study broadly showed how EFL learners’ speech act is conveyed from their nativecultures when they are communicating in English and demonstrated that there are somepossibilities of cross-cultural misunderstanding when interlocutors are engaged in the speechact of complimenting with native speakers of...

متن کامل

Walls of the Tongue: A Sociolinguistic Analysis of Ursula K. Le Guin’s The Dispossessed

“Good” science fiction, if one may be allowed to propose such a definition, is that which transports its readers out of the banal and ordinary, into the world of the what if? and the alien. Writers of good science fiction naturally differ in their implementation of this, but some have opted to employ the more sophisticated tools of linguistics, constructing exotic alien languages or fragments t...

متن کامل

Extracting Social Power Relationships from Natural Language

Sociolinguists have long argued that social context influences language use in all manner of ways, resulting in lects 1 . This paper explores a text classification problem we will call lect modeling, an example of what has been termed computational sociolinguistics. In particular, we use machine learning techniques to identify social power relationships between members of a social network, base...

متن کامل

Signaling and Simulations in Sociolinguistics

Along with game theory, the emerging science of networks has given us a framework for analyzing social systems plausible to both intuition and implementation. As an interaction structure in computer simulation models, social networks provide a way to envision phenomena like information spread, dialect formation, and language change in a more robust way. In this sense a multitude of sociolinguis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014